Intonation processing for TTS using stylization and neural network learning method

نویسندگان

  • Jung-Chul Lee
  • Youngjik Lee
  • Sanghun Kim
  • Minsoo Hahn
چکیده

In this paper, we propose a new model for synthesizing fundamental frequency (F0) contours using a stylization and a neural network learning method. The F0 contour is described as the superposition of 4 layered features; global tune, word pitch bias, lexical tone, and the syllabic pitch pattern. We rstly stylize the F0 contour of speech material, and analyze stylized data by statistical approach according to grammatical attributes. We then construct a melodic table, and train lexical tone with a neural network. Finally we develop the intonation generation rules for TTS conversion. This model produces a good neutral declarative intonation, and there is little di erence between synthesized speech with original F0 contour and that with the rule generated contour when tested with our TD-PSOLA synthesizer[6][7].

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Intonation Processing for Tts Using Stylization and Neural Network Learing Method

In this paper, we propose a new model for synthesizing fundamental frequency (F0) contours using a stylization and a neural network learning method. The F0 contour is described as the superposition of 4 layered features; global tune, word pitch bias, lexical tone, and the syllabic pitch pattern. We rstly stylize the F0 contour of speech material, and analyze stylized data by statistical approac...

متن کامل

Comparison of chironomic stylization versus statistical modeling of prosody for expressive speech synthesis

Chironomic stylization is the process of real-time modification of intonation contours (f0 and tempo) using drawing/writing gestures with a stylus on a graphic tablet. The question addressed in this research is whether hand-made intonation stylization could improve or degrade expressivity and overall quality, compared to statistical modeling of prosody. A system for expressive TTS in French bas...

متن کامل

An automatic intonation recognizer for the Polish language based on machine learning and expert knowledge

In the paper a new automatic intonation recognizer for the Polish language is presented. The recognizer design combines Machine Learning and expert knowledge techniques. Machine Learning is used in pitch stylization (Artificial Neural Network), speech alignment (external design based on Hidden Markov Model) and intonation decoding (Hidden Markov Model). Expert knowledge drives phonemization, sy...

متن کامل

Automatic modeling and implementation of intonation for the arabic language in TTS systems

This paper proposes a set of rules for the automatic generation of F0 contours for modern standard Arabic (MSA) affirmative and interrogative sentences. The objective is to finalize a model for the automatic processing of the intonative pattern in different TTS systems (e.g. synthesis by formants using Klatt synthesizer and synthesis by diphones, using a synthesizer based on PSOLA algorithm). T...

متن کامل

Automatic Analysis and Synthesis of Fujisaki’s Intonation Model for TTS

This paper deals with the automatic analysis and synthesis of intonation using Fujisaki’s model. We propose an analysis method which imposes strong linguistic constraints. This method produces good representations of the F0 contour when compared to other current methods which do not impose such constrains. Furthermore, this option limits the variability and is more predictable so it is the best...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996